377 research outputs found

    String Synchronizing Sets: Sublinear-Time BWT Construction and Optimal LCE Data Structure

    Full text link
    Burrows-Wheeler transform (BWT) is an invertible text transformation that, given a text TT of length nn, permutes its symbols according to the lexicographic order of suffixes of TT. BWT is one of the most heavily studied algorithms in data compression with numerous applications in indexing, sequence analysis, and bioinformatics. Its construction is a bottleneck in many scenarios, and settling the complexity of this task is one of the most important unsolved problems in sequence analysis that has remained open for 25 years. Given a binary string of length nn, occupying O(n/logn)O(n/\log n) machine words, the BWT construction algorithm due to Hon et al. (SIAM J. Comput., 2009) runs in O(n)O(n) time and O(n/logn)O(n/\log n) space. Recent advancements (Belazzougui, STOC 2014, and Munro et al., SODA 2017) focus on removing the alphabet-size dependency in the time complexity, but they still require Ω(n)\Omega(n) time. In this paper, we propose the first algorithm that breaks the O(n)O(n)-time barrier for BWT construction. Given a binary string of length nn, our procedure builds the Burrows-Wheeler transform in O(n/logn)O(n/\sqrt{\log n}) time and O(n/logn)O(n/\log n) space. We complement this result with a conditional lower bound proving that any further progress in the time complexity of BWT construction would yield faster algorithms for the very well studied problem of counting inversions: it would improve the state-of-the-art O(mlogm)O(m\sqrt{\log m})-time solution by Chan and P\v{a}tra\c{s}cu (SODA 2010). Our algorithm is based on a novel concept of string synchronizing sets, which is of independent interest. As one of the applications, we show that this technique lets us design a data structure of the optimal size O(n/logn)O(n/\log n) that answers Longest Common Extension queries (LCE queries) in O(1)O(1) time and, furthermore, can be deterministically constructed in the optimal O(n/logn)O(n/\log n) time.Comment: Full version of a paper accepted to STOC 201

    Kurtosis-based detection of intracranial high-frequency oscillations for the identification of the seizure onset zone

    Get PDF
    Pathological High-Frequency Oscillations (HFOs) have been recently proposed as potential biomarker of the seizure onset zone (SOZ) and have shown superior accuracy to interictal epileptiform discharges in delineating its anatomical boundaries. Characterization of HFOs is still in its infancy and this is reflected in the heterogeneity of analysis and reporting methods across studies and in clinical practice. The clinical approach to HFOs identification and quantification usually still relies on visual inspection of EEG data. In this study, we developed a pipeline for the detection and analysis of HFOs. This includes preliminary selection of the most informative channels exploiting statistical properties of the pre-ictal and ictal intracranial EEG (iEEG) time series based on spectral kurtosis, followed by wavelet-based characterization of the time-frequency properties of the signal. We performed a preliminary validation analyzing EEG data in the ripple frequency band (80-250[Formula: see text]Hz) from six patients with drug-resistant epilepsy who underwent pre-surgical evaluation with stereo-EEG (SEEG) followed by surgical resection of pathologic brain areas, who had at least two-year positive post-surgical outcome. In this series, kurtosis-driven selection and wavelet-based detection of HFOs had average sensitivity of 81.94% and average specificity of 96.03% in identifying the HFO area which overlapped with the SOZ as defined by clinical presurgical workup. Furthermore, the kurtosis-based channel selection resulted in an average reduction in computational time of 66.60%

    Finding the Anticover of a String

    Get PDF
    A k-anticover of a string x is a set of pairwise distinct factors of x of equal length k, such that every symbol of x is contained into an occurrence of at least one of those factors. The existence of a k-anticover can be seen as a notion of non-redundancy, which has application in computational biology, where they are associated with various non-regulatory mechanisms. In this paper we address the complexity of the problem of finding a k-anticover of a string x if it exists, showing that the decision problem is NP-complete on general strings for k ? 3. We also show that the problem admits a polynomial-time solution for k=2. For unbounded k, we provide an exact exponential algorithm to find a k-anticover of a string of length n (or determine that none exists), which runs in O*(min {3^{(n-k)/3)}, ((k(k+1))/2)^{n/(k+1)) time using polynomial space

    Changes in the Splenic Melanomacrophage Centre Surface Area in Southern Bluefin Tuna (Thunnus maccoyii) Are Associated with Blood Fluke Infections

    Get PDF
    Melanomacrophage centres (MMCs) are aggregates of macrophages accumulating various pigments. They have been proposed as an indicator of fish immune response. Blood flukes are common parasites in farmed fish. Two cohorts of wild Southern Bluefin Tuna (Thunnus maccoyi) were examined at transfer, before treatment against blood flukes (pre-treatment) and at harvest. MMCs were assessed in histological sections using image analysis, while Cardicola forsteri and Cardicola orientalis infection severity was determined using qPCR, count of adult flukes in heart flushes and count of eggs in gill filaments. Fish from both cohorts showed the same pattern in the changes in the surface area of MMCs. The surface area of splenic MMCs increased over the ranching duration and was positively correlated to the PCR determined copy numbers of Cardicola forsteri ITS2 rDNA in the gills of those fish. However, the infection with blood fluke was more variable, both between cohorts and individuals within the same cohort. Eggs of blood fluke were detected in renal MMCs using histology. Cardicola forsteri had a higher prevalence than Cardicola orientalis. This study contributes to our understanding of blood fluke infections in Southern Bluefin Tuna and their interactions with MMCsS

    EPINETLAB:a software for seizure-onset zone identification from intracranial EEG signal in epilepsy

    Get PDF
    The pre-operative workup of patients with drug-resistant epilepsy requires in some candidates the identification from intracranial EEG (iEEG) of the seizure-onset zone (SOZ), defined as the area responsible of the generation of the seizure and therefore candidate for resection. High-frequency oscillations (HFOs) contained in the iEEG signal have been proposed as biomarker of the SOZ. Their visual identification is a very onerous process and an automated detection tool could be an extremely valuable aid for clinicians, reducing operator-dependent bias and computational time. In this manuscript we present the EPINETLAB software, developed as a collection of routines integrated in the EEGLAB framework that aim to provide clinicians with a structured analysis pipeline for HFOs detection and SOZ identification. The tool implements an analysis strategy developed by our group and underwent a preliminary clinical validation that identifies the HFOs area by extracting the statistical properties of HFOs signal and that provides useful information for a topographic characterization of the relationship between clinically defined SOZ and HFO area. Additional functionalities such as inspection of spectral properties of ictal iEEG data and import and analysis of source-space MEG data were also included. EPINETLAB was developed with user-friendliness in mind to support clinicians in the identification and quantitative assessment of HFOs in iEEG and source space MEG data and aid the evaluation of the SOZ for pre-surgical assessment

    Patient-specific detection of cerebral blood flow alterations as assessed by arterial spin labeling in drug-resistant epileptic patients

    Get PDF
    Electrophysiological and hemodynamic data can be integrated to accurately and precisely identify the generators of abnormal electrical activity in drug-resistant focal epilepsy. Arterial Spin Labeling (ASL), a magnetic resonance imaging (MRI) technique for quantitative noninvasive measurement of cerebral blood flow (CBF), can provide a direct measure of variations in cerebral perfusion associated with the epileptic focus. In this study, we aimed to confirm the ASL diagnostic value in the identification of the epileptogenic zone, as compared to electrical source imaging (ESI) results, and to apply a template-based approach to depict statistically significant CBF alterations. Standard video-electroencephalography (EEG), high-density EEG, and ASL were performed to identify clinical seizure semiology and noninvasively localize the epileptic focus in 12 drug-resistant focal epilepsy patients. The same ASL protocol was applied to a control group of 17 healthy volunteers from which a normal perfusion template was constructed using a mixed-effect approach. CBF maps of each patient were then statistically compared to the reference template to identify perfusion alterations. Significant hypo- and hyperperfused areas were identified in all cases, showing good agreement between ASL and ESI results. Interictal hypoperfusion was observed at the site of the seizure in 10/12 patients and early postictal hyperperfusion in 2/12. The epileptic focus was correctly identified within the surgical resection margins in the 5 patients who underwent lobectomy, all of which had good postsurgical outcomes. The combined use of ESI and ASL can aid in the noninvasive evaluation of drug-resistant epileptic patients

    A case of clinical worsening after stereo-electroencephalographic-guided radiofrequency thermocoagulation in a patient with polymicrogyria

    Get PDF
    : Radiofrequency thermocoagulation (RF-TC) is a wide-used procedure for drug-resistant epilepsy. The technique is considered safe with an overall risk of 1.1% of permanent complications, mainly focal neurological deficits. We report the case of a patient with drug-resistant epilepsy who complained of immediate seizure worsening and an unexpected event seven months following RF-TC. A 35-year-old male with drug-resistant epilepsy from the age of 18 years underwent stereoelectroencephalography (SEEG) implantation for a right peri-silvian polymicrogyria. He was excluded from surgery due to extent of the epileptogenic zone and the risk of visual field deficits. RF-TC was attempted to ablate the most epileptogenic zone identified by SEEG. After RF-TC, the patient reported an increase in seizure severity/frequency and experienced episodes of postictal psychosis. Off-label cannabidiol treatment led to improved seizure control and resolution of postictal psychosis. Patients with polymicrogyria (PwP) may present with a disruption of normal anatomy and the co-existence between epileptogenic zone and eloquent cortex within the malformation. RF-TC should be considered in PwP when they are excluded from surgery for prognostic and palliative purposes. However, given the complex interplay between pathological and electrophysiological networks in these patients, the remote possibility of clinical exacerbation after RF-TC should also be taken into account

    Multicenter clinical assessment of improved wearable multimodal convulsive seizure detectors

    Get PDF
    Objective New devices are needed for monitoring seizures, especially those associated with sudden unexpected death in epilepsy (SUDEP). They must be unobtrusive and automated, and provide false alarm rates (FARs) bearable in everyday life. This study quantifies the performance of new multimodal wrist-worn convulsive seizure detectors. Methods Hand-annotated video-electroencephalographic seizure events were collected from 69 patients at six clinical sites. Three different wristbands were used to record electrodermal activity (EDA) and accelerometer (ACM) signals, obtaining 5,928 h of data, including 55 convulsive epileptic seizures (six focal tonic–clonic seizures and 49 focal to bilateral tonic–clonic seizures) from 22 patients. Recordings were analyzed offline to train and test two new machine learning classifiers and a published classifier based on EDA and ACM. Moreover, wristband data were analyzed to estimate seizure-motion duration and autonomic responses. Results The two novel classifiers consistently outperformed the previous detector. The most efficient (Classifier III) yielded sensitivity of 94.55%, and an FAR of 0.2 events/day. No nocturnal seizures were missed. Most patients had <1 false alarm every 4 days, with an FAR below their seizure frequency. When increasing the sensitivity to 100% (no missed seizures), the FAR is up to 13 times lower than with the previous detector. Furthermore, all detections occurred before the seizure ended, providing reasonable latency (median = 29.3 s, range = 14.8–151 s). Automatically estimated seizure durations were correlated with true durations, enabling reliable annotations. Finally, EDA measurements confirmed the presence of postictal autonomic dysfunction, exhibiting a significant rise in 73% of the convulsive seizures. Significance The proposed multimodal wrist-worn convulsive seizure detectors provide seizure counts that are more accurate than previous automated detectors and typical patient self-reports, while maintaining a tolerable FAR for ambulatory monitoring. Furthermore, the multimodal system provides an objective description of motor behavior and autonomic dysfunction, aimed at enriching seizure characterization, with potential utility for SUDEP warning
    corecore